WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 : 
C07K 14/435, G01N 33/533 



Al 



(11) International Publication Number: WO 00/08054 

(43) International Publication Date: 17 February 2000 (17.02.00) 



(21) International Application Number: PCT/GB 99/02596 

(22) International Filing Date: 6 August 1999 (06.08.99) 



(30) Priority Data: 

9817225.7 
9817227.3 
9817229.9 



8 August 1998 (08.08.98) GB 
8 August 1998 (08.08.98) GB 
8 August 1998 (08.08.98) GB 



(71) Applicant (for all designated States except US): IMPERIAL 

CANCER RESEARCH TECHNOLOGY LIMITED 
[GB/GB]; Sardinia House, Sardinia Street, London WC2A 
3NL (GB). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): BASTIAENS, Philippe 
[NL/GB]; 134a Royal College Street, Camden, London 
NW1 OTA (GB). PEPPERKOK, Rainer [DE/DEJ; Hohl 
Strasse 19, D-68809 Neulussheim (DE). GELEY, Stefan 
[AT/GB]; 11 Frampton Road, Potters Bar, Herts EN6 1JF 
(GB). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CR, CU, CZ, DE, DK, EE, ES, FI, GB, 
GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, 
KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, 
MN, MW, MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, 
SK, SL, TJ, TM, TR, TT, UA, UG, US, UZ, VN, YU, ZA, 
ZW, ARIPO patent (GH, GM, KE, LS, MW, SD, SL, SZ, 
UG, ZW), Eurasian patent (AM, AZ, BY, KG, KZ, MD, 
RU, TJ, TM), European patent (AT, BE, CH, CY, DE, DK, 
ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI 
patent (BF, BJ, CF, CG, CI, CM, GA, GN, GW, ML, MR, 
NE, SN, TD, TG). 



Published 

With international search report. 



(74) Agent: MILES, John; Eric Potter & Clarkson, Park View 
House, 58 The Ropewalk, Nottingham NG1 5DD (GB). 



(54) Title: MODIFIED GREEN FLUORESCENT PROTEIN 



(57) Abstract 



A polypeptide comprising the functional portion of green fluorescent protein but wherein valine (V) (163) has been replaced with 
alanine (A), serine (S) (175) has been replaced with glycine (G), isoleucine (I) (167) has been replaced with threonine (T), phenylalanine 
(F) (64) has been replaced with leucine (L), serine (S) (65) has been replaced with threonine (T), serine (S) (72) has been replaced with 
alanine (A), and threonine (T) (203) has been replaced with tyrosine (Y). Polynucleotides encoding the protein and uses of the protein as a 
reporter molecule are also described. 
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1 

MODIFIED GREEN FLUORESCENT PROTEIN 

The present invention relates to fluorescent proteins, in particular to 
"green fluorescent proteins" (GFPs) and mutants thereof which have 
altered spectral properties compared to the wild type GFP. 

Fluorescent assays for biological systems have been used for several years 
since, compared to many biological techniques, they have the advantage 
of, in some circumstances, being able to be carried out non-invasively and 
also they are able to give real time analysis of particular reactions in 
complex biological systems in which many reactions are being carried out 
simultaneously. In conjunction with the development of physical 
techniques for fluorescent assays has been the development of biological 
reporter constructs which serve as monitors of reactions, for example, 
within a cell. In particular, the development of fluorescent proteins that 
do not require cofactors for their intrinsic fluorescence has meant that 
such proteins can be introduced into and expressed in cells via genetic 
constructs. 

Examples of intrinsically fluorescent proteins, notably so-called Aequorea 
victoria "green fluorescent proteins" or u GFPs" (although they may be 
blue or yellow) are known. Miyawaki et al (1997) Nature 388, 882-887 
describes a GFP-based Ca 2+ sensing system; Mitra et al (1996) Gene 173, 
13-17 describes a two-GFP-based system for use in identifying protease 
inhibitors; WO 97/28261 discloses a two-GFP system in which the GFP 
donor and GFP acceptor are linked by a peptide containing a protease 
cleavage site. WO 95/07463 describes uses of GFPs; WO 96/23898 



WO 00/08054 PCT/GB99/02596 

2 

relates to a method of detecting biologically active substances using GFPs; 
Heim & Tsien (1996) Current Biology 6, 178-182 relates to engineered 
GFPs with improved brightness, longer wavelengths and fluorescence 
resonance energy transfer (FRET); Poppenborg et al (1997) J. BiotechnoL 
5 58, 79-88 relates to GFPs as reporters for bioprocess monitoring; Park & 
Raines (1997) Protein Science 6, 2344-2349 relates to a GFP as a signal 
for protein-protein interactions; Niswender et al (1995) J. Microscopy 
180, 109-116 relates to quantitative imaging of GFP in cultured cells; 
Chalfie et al (1994) Science 263, 802-805 relates to GFP as a marker for 

10 gene expression; Hampton et al (1996) Proc. NatL Acad. Sci. USA 93, 
828-833 relates to the in vivo examination of membrane protein 
localization and degradation with GFP; Heim et al (1995) Nature 373, 
663-664 relates to mutant GFPs with altered fluorescent properties; 
Mosser et al (1997) BioTechniques 22, 150-161 relates to the use of a 

15 dicistronic expression cassette encoding GFP for the screening and 
selection of cells expressing inducible gene products; Suarez et al (1997) 
Gene 196, 69-74 relates to GFP-based reporter systems for genetic 
analysis of bacteria; Niedenthal et al (1996) Yeast 12, 773-778 relates to 
GFP as a marker for gene expression and subcellular localization in 

20 budding yeast; and Prescott et al (1997) FEBS Lett 411, 97-101 relates to 
the use of GFP as a marker for assembled mitochondrial ATP synthase in 
yeast. GFPs and their uses have been reviewed in Pozzan et al (1997) 
Nature 388, 834-835, Misteli & Spector (1997) Nature Biotechnology 15, 
961-964; and Cubitt et al (1995) Trends Biochem. Sci. 20, 448-455. 

25 

Despite many variant GFPs being known, there continues to be the need 
for variant GFPs which have improved properties, especially improved or 
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new spectral properties, for use in biological systems, especially those 
where fluorescence resonance energy transfer (FRET) is used to study the 
biological system. 

5 An example of a FRET-based method for studying biological systems is 
described in detail in UK Patent Application No 9817229.9 entitled 
"Fluorescent assay for biological systems " and the PCT application 
claiming priority from that application and which was filed on the same 
day as this application. A further example of a FRET-based method is 
10 described in Miyawaki et al (1997) Nature 388, 882-887. 

A first aspect of the invention provides a polypeptide comprising the 
functional portion of green fluorescent protein but wherein valine (V) 163 
has been replaced with alanine (A), serine (S) 175 has been replaced with 
15 glycine (G), isoleucine (I) 167 has been replaced with threonine (T), 
phenylalanine (F) 64 has been replaced with leucine (L), serine (S) 65 has 
been replaced with threonine (T), serine (S) 72 has been replaced with 
alanine (A), and threonine (T) 203 has been replaced with tyrosine (Y). 

20 By "functional portion" we include the meaning that it is the portion of the 
protein which, in the absence of other portions of GFP, gives rise to 
useful fluorescent properties, such as the portion being fluorescent. It will 
be appreciated that, in respect of this first aspect of the invention the GFP 
or polypeptide comprising the functional portion of GFP with the given 

25 mutations may also include other mutations which may confer further 
desirable properties. 
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A second aspect of the invention provides a polypeptide which has the 
amino acid sequence 

MGKGEELFTGWP I L VELDGD VNGQKF S VRGEGEGD ATYGKLTLKF I CTTGKL P VPWPTLVTTLTY 
GVQCFARYPDHMKR 

HDFLKSAMPEGYVQERTIFFKDDGNYKTRAEWFEGDTLVNRIELKGIDFKEDGNILGHKLEYl^ 

SHNVYIMADKQKNG 

IKANFKTRHNIEDGGVQIJU5HYQQNT^ 

AGITHGMDELYKSF 

This is the sequence shown in Figure 3. 

Conveniently, the difference with respect to the native GFP can be 
summarised as S2G, H25Q, S30R, F64L, S65T, S72A, Q80R, F84L, 
V163A, I167T, S175G, T203Y, K209Q, N212H, E213G, A216S, + 
239S + 240F, making use of standard single letter amino acid code. 

By "green fluorescent protein", in the context of a single protein, we 
mean wild-type green fluorescent protein as described in Prasher et al 
(1992) Gene 111, 229-233 and whose amino acid sequence is given in 
Figure 1. As noted above, the term GFP may be used to denote variants 
which in fact fluoresce yellow or blue. 

The sequence of a particular preferred polypeptide of the invention 
(variant GFP) is shown in Figure 3, and its sequence is compared with A. 
victoria GFP and mm GFP5 (Zernicka-Goeta et al (1997) Development 
124, 1133-1137 in Figures 4 and 5. 

It will be appreciated that the functional portion of the polypeptide which 
contains the mutations as said may be incorporated into any suitable 
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polypeptide in which it is desired to have a fluorescent moiety. Typically, 
the functional portion is included in a polypeptide whose fluorescence or 
change in fluorescence is measured under suitable conditions. Thus, the 
polypeptide may be one which is expressed as a reporter molecule (since 
5 its expression may be measured fluorimetrically). Alternatively, the 
functional portion may be included in a polypeptide which is used in a 
biological system which makes use of FRET. For example, a polypeptide 
of the invention includes a polypeptide which contains the fluorescent 
portion as said, and it is used in conjunction with another fluorescent 

10 moiety with which it acts as a donor or acceptor in a FRET reaction. 
Most suitably, the polypeptide of the invention contains, in addition to the 
functional portion as said, a further fluorescent moiety in the same 
polypeptide chain and the pair of fluorescent moieties may act as donor- 
acceptor pairs in a FRET reaction. Thus, the polypeptide of the invention 

15 typically is a fusion protein containing at least the functional portion of the 
polypeptide which contains the mutations as said. 

Thus, the polypeptide of the invention may be used in any suitable prior 
art FRET method. 

20 

The polypeptides of the invention are particularly suited for use in the 
FRET method described in UK Patent Application No 9817229.9 entitled 
"Fluorescent assay for biological systems" and the PCT application 
claiming priority from that application and which was filed on the same 
25 day as this application since the mutations confer an unusually high 
fluorescent lifetime. The polypeptides of the invention are believed to be 
particularly suited as acceptor molecules since, at least in relation to the 
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molecule of the second aspect of the invention, it excites at 514nm and 
emits at 531nm. 

A particularly preferred polypeptide of the invention is one which has the 
5 amino acid sequence as shown in Figure 3. 

A further preferred polypeptide of the invention is one which comprises at 
least residues 7 to 229 of green fluorescent protein containing said amino 
acid replacements. The minimal domain required for fluorescence in GFP 
io is believed to be amino acids 7 to 229 (Li et al (1997) /. Biol. Chem. 272, 
28545-28549. Also, this information, and other information about GFPs, 
is available from Clontech Laboratories, Inc., 1020 East Meadow Circle, 
Palo Alto, CA 94303, USA ( xqli@CLONTECH . com) . 

15 It is preferred that a polypeptide which comprises at least residues 7 to 
229 of green fluorescent protein not only contains the amino acid 
replacements described with respect to the first aspect of the invention, but 
also, compared to native A. victoria GFP, the amino acid replacements 
H25Q, S30R, Q80R, F84L, K209Q, N212H, F213G and A216S. 

20 

A still further preferred polypeptide is one comprising a further 
fluorescent moiety. In particular, the further fluorescent moiety is one 
which is capable of FRET with the said portion of the variant GFP. 
Typically, this further fluorescent moiety is a GFP or a variant GFP. 
25 Thus, a particularly preferred polypeptide of the invention is a fusion 
polypeptide which contains at least the functional portion of the 
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polypeptide which contains the mutations as said, and contains a further 
mutant GFP. 

A third aspect of the invention provides a polynucleotide encoding a 
5 polypeptide of the first or second aspect of the invention. The 
polynucleotide may be DNA or RNA; DNA is preferred. A particularly 
preferred polynucleotide of the invention is shown in Figure 3 (DNA 
sequence) but, because of the degeneracy of the genetic code, it will be 
appreciated that other polynucleotides may encode the same polypeptide 
10 (ie with the amino acid sequence given in Figure 3). 

A fourth aspect of the invention provides an expression vector encoding a 
polypeptide of the first or second aspect of the invention. 

15 The expression vectors of the invention, and other polynucleotides can be 
constructed by standard laboratory molecular biology methods such as 
those described in Sambrook et al (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, New 
York incorporated herein by reference. 

20 

The polynucleotide of the invention (typically DNA) may be expressed in a 
suitable host to produce a polypeptide comprising the polypeptide of the 
invention. Thus, the DNA encoding the polypeptide of the invention may 
be used in accordance with known techniques, appropriately modified in 
25 view of the teachings contained herein, to construct an expression vector, 
which is then used to transform an appropriate host cell for the expression 
and production of the polypeptide of the invention. Such techniques include 
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those disclosed in US Patent Nos. 4,440,859 issued 3 April 1984 to Rutter 
et al 9 4,530,901 issued 23 July 1985 to Weissman, 4,582,800 issued 15 
April 1986 to Crowl, 4,677,063 issued 30 June 1987 to Mark et al, 
4,678,751 issued 7 July 1987 to Goeddel, 4,704,362 issued 3 November 
5 1987 to Itakura et al, 4,710,463 issued 1 December 1987 to Murray, 
4,757,006 issued 12 July 1988 to Toole, Jr. et al, 4,766,075 issued 23 
August 1988 to Goeddel et al and 4,810,648 issued 7 March 1989 to 
Stalker, all of which are incorporated herein by reference. 

10 The polynucleotide, such as DNA, encoding the polypeptide of the 
invention may be joined to a wide variety of other DNA sequences for 
introduction into an appropriate host. The companion DNA will depend 
upon the nature of the host, the manner of the introduction of the DNA into 
the host, and whether episomal maintenance or integration is desired. 

15 

Generally, the DNA is inserted into an expression vector, such as a 
plasmid, in proper orientation and correct reading frame for expression. If 
necessary, the DNA may be linked to the appropriate transcriptional and 
translational regulatory control nucleotide sequences recognised by the 

20 desired host, although such controls are generally available in the expression 
vector. The vector is then introduced into the host through standard 
techniques. Generally, not all of the hosts will be transformed by the 
vector. Therefore, it will be necessary to select for transformed host cells. 
One selection technique involves incorporating into the expression vector a 

25 DNA sequence, with any necessary control elements, that codes for a 
selectable trait in the transformed cell, such as antibiotic resistance. 
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Alternatively, the gene for such selectable trait can be on another vector, 
which is used to co-transform the desired host cell. 

Host cells that have been transformed by the recombinant DNA of the 
5 invention are then cultured for a sufficient time and under appropriate 
conditions known to those skilled in the art in view of the teachings 
disclosed herein to permit the expression of the polypeptide, which can then 
be recovered. 

10 Many expression systems are known, including bacteria (for example E. 
coli and Bacillus subtilis), yeasts (for example Saccharomyces cerevisiae), 
filamentous fungi (for example Aspergillus), plant cells, animal cells and 
insect cells. 

15 The vectors include a prokaryotic replicon, such as the ColEl ori, for 
propagation in a prokaryote, even if the vector is to be used for expression 
in other, non-prokaryotic, cell types. The vectors can also include an 
appropriate promoter such as a prokaryotic promoter capable of directing 
the expression (transcription and translation) of the genes in a bacterial host 

20 cell, such as E. coli, transformed therewith. 

A promoter is an expression control element formed by a DNA sequence 
that permits binding of RNA polymerase and transcription to occur. 
Promoter sequences compatible with exemplary bacterial hosts are typically 
25 provided in plasmid vectors containing convenient restriction sites for 
insertion of a DNA segment of the present invention. 
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Typical prokaryotic vector plasmids are pUC18, pUC19, pBR322 and 
pBR329 available from Biorad Laboratories, (Richmond, CA, USA) and 
p7rc99A and pKK223-3 available from Pharmacia, Piscataway, NJ, USA. 

5 A typical mammalian cell vector plasmid is pSVL available from 
Pharmacia, Piscataway, NJ, USA. This vector uses the SV40 late promoter 
to drive expression of cloned genes, the highest level of expression being 
found in T antigen-producing cells, such as COS-1 cells. 

10 An example of an inducible mammalian expression vector is pMSG, also 
available from Pharmacia. This vector uses the glucocorticoid-inducible 
promoter of the mouse mammary tumour virus long terminal repeat to drive 
expression of the cloned gene. 

15 Useful yeast plasmid vectors are pRS403-406 and pRS413-416 and are 
generally available from Stratagene Cloning Systems, La Jolla, CA 92037, 
USA. Plasmids pRS403, pRS404, pRS405 and pRS406 are Yeast 
Integrating plasmids (Yips) and incorporate the yeast selectable markers 
HIS3, TRP1, LEU2 and URA3. Plasmids pRS413-416 are Yeast 

20 Centromere plasmids (YCps). 

A variety of methods have been developed to operably link DNA to vectors 
via complementary cohesive termini. For instance, complementary 
homopolymer tracts can be added to the DNA segment to be inserted into 
25 the vector DNA. The vector and DNA segment are then joined by 
hydrogen bonding between the complementary homopolymeric tails to form 
recombinant DNA molecules. 
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Synthetic linkers containing one _or more restriction sites provide an 
alternative method of joining the DNA segment to vectors. The DNA 
segment, generated by endonuclease restriction digestion, is treated with 
5 bacteriophage T4 DNA polymerase or E. coli DNA polymerase I, enzymes 
that remove protruding, 3' -single-stranded termini with their 3 '-5'- 
exonucleolytic activities, and fill in recessed 3 '-ends with their polymerizing 
activities. 

10 The combination of these activities therefore generates blunt-ended DNA 
segments. The blunt-ended segments are then incubated with a large molar 
excess of linker molecules in the presence of an enzyme that is able to 
catalyze the ligation of blunt-ended DNA molecules, such as bacteriophage 
T4 DNA ligase. Thus, the products of the reaction are DNA segments 

15 carrying polymeric linker sequences at their ends. These DNA segments 
are then cleaved with the appropriate restriction enzyme and ligated to an 
expression vector that has been cleaved with an enzyme that produces 
termini compatible with those of the DNA segment. 

20 Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including International 
Biotechnologies Inc, New Haven, CN, USA. 

A desirable way to modify the DNA encoding the polypeptide of the 
25 invention is to use the polymerase chain reaction as disclosed by Saiki et al 
(1988) Science 239, 487-491. 
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In this method the DNA to be enzymatically amplified is flanked by two 
specific oligonucleotide primers which themselves become incorporated into 
the amplified DNA. The said specific primers may contain restriction 
endonuclease recognition sites which can be used for cloning into expression 
5 vectors using methods known in the art. 

The present invention also relates to a host cell transformed with a 
polynucleotide vector construct of the present invention. The host cell can 
be either prokaryotic or eukaryotic. Bacterial cells are preferred 

10 prokaryotic host cells and typically are a strain of E. coli such as, for 
example, the E. coli strains DH5 available from Bethesda Research 
Laboratories Inc., Bethesda, MD, USA, and RR1 available from the 
American Type Culture Collection (ATCC) of Rockville, MD, USA (No 
ATCC 31343). Preferred eukaryotic host cells include yeast and 

15 mammalian cells, preferably vertebrate cells such as those from a mouse, 
rat, monkey or human fibroblastic cell line. Yeast host cells include 
YPH499, YPH500 and YPH501 which are generally available from 
Stratagene Cloning Systems, La Jolla, CA 92037, USA. Preferred 
mammalian host cells include Chinese hamster ovary (CHO) cells available 

20 from the ATCC as CCL61, NIH Swiss mouse embryo cells NIH/3T3 
available from the ATCC as CRL 1658, and monkey kidney-derived COS-1 
cells available from the ATCC as CRL 1650. 

Transformation of appropriate cell hosts with a DNA construct of the 
25 present invention is accomplished by well known methods that typically 
depend on the type of vector used. With regard to transformation of 
prokaryotic host cells, see, for example, Cohen et al (1972) Proc. Natl 
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Acad. ScL USA 69, 2110 and Sambrook et al (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY. Transformation of yeast cells is described in Sherman et al (1986) 
Methods In Yeast Genetics, A Laboratory Manual, Cold Spring Harbor, 
5 NY. The method of Beggs (1978) Nature 275, 104-109 is also useful. 
With regard to vertebrate cells, reagents useful in transfecting such cells, for 
example calcium phosphate and DEAE-dextran or liposome formulations, 
are available from Stratagene Cloning Systems, or Life Technologies Inc., 
Gaithersburg, MD 20877, USA. 

10 

Electroporation is also useful for transforming cells and is well known in the 
art for transforming yeast cells, bacterial cells and vertebrate cells. 

For example, many bacterial species may be transformed by the methods 
15 described in Luchansky et al (1988) Mol Microbiol 2, 637-646 
incorporated herein by reference. The greatest number of transformants is 
consistentiy recovered following electroporation of the DNA-cell mixture 
suspended in 2.5X PEB using 6250V per cm at 25|aFD. 

20 Methods for transformation of yeast by electroporation are disclosed in 
Becker & Guarente (1990) Methods Enzymol. 194, 182. 

Successfully transformed cells, ie cells that contain a DNA construct of the 
present invention, can be identified by well known techniques. For 
25 example, cells resulting from the introduction of an expression construct of 
the present invention can be grown to produce the polypeptide of the 
invention. Cells can be harvested and lysed and their DNA content 



WO 00/08054 PCT/GB99/02596 

14 

examined for the presence of the DNA using a method such as that 
described by Southern (1975) J. Mol Biol 98, 503 or Berent et al (1985) 
Biotech. 3, 208. Alternatively, the presence of the protein in the 
supernatant can be detected using antibodies. 

5 

However, a convenient way of identifying transformed cells which express 
the polypeptide is that they are fluorescent. 

In addition to direcdy assaying for the presence of recombinant DNA, 
10 successful transformation can be confirmed by well known immunological 

methods when the recombinant DNA is capable of directing the expression 

of the protein. For example, cells successfully transformed with an 

expression vector produce proteins displaying appropriate antigenicity. 

Samples of cells suspected of being transformed are harvested and assayed 
15 for the protein using suitable antibodies. Of course, transformation and 

expression is indicated by the production of a fluorescent protein in this 

case. 



Thus, in addition to the transformed host cells themselves, the present 
20 invention also contemplates a culture of those cells, preferably a monoclonal 
(clonally homogeneous) culture, or a culture derived from a monoclonal 
culture, in a nutrient medium. 

A particularly suitable "starting" vector is the pcDNA3.1 vector 
25 distributed by Invitrogen (Invitrogen BV, De Schelp 12, 9351 NV Leek, 
The Netherlands). The key features of this vector for this invention are: (i) 
Cytomegalovirus enhancer-promoter for high level expression of the insert 
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in mammalian cells (the insert is for example the cDNA encoding the 
polypeptides described above and, has to be cloned into the vector); (ii) 
multiple cloning site in forward and reverse orientation; (iii) expression 
cassette for a selectable marker in eukaryotic cells (neomycin, zeocin or 
5 hygromycin). 

When the polypeptide of the invention is to be used ex vivo, such as in an 
in vitro assay or the like, it may be convenient to express the polypeptide 
in a bacterial system (such as E. coli), or in yeast or insect cells, or in 
10 other systems which have been designed for facile production of large 
amounts of protein. When the polypeptide of the invention is to be used 
in an in vivo assay it is conveniently expressed in the cell in which the 
assay is to be carried out, typically a mammalian cell. 

15 In a particularly preferred embodiment, DNA encoding the polypeptide of 
the invention (such as YFP5) may be fused to the promoter/enhancer 
elements of a gene under investigation. Such DNA stably introduced into 
mammalian cells may be used as a reporter for expression of the 
respective gene under investigation. Readout of the activity is the amount 

20 of polypeptide, such as YFP5, expressed, which can be determined by 
determination of the specific fluorescence of the polypeptide. Similar 
DNAs may be generated for the other GFPs such as those listed in Table 1 
(see below). Since they have overlapping spectra they cannot be used 
simultaneously in the same cells. However, using fluorescent lifetime 

25 imaging CFP, MmGFPS and YFP5, for example, could be used 
simultaneously as their lifetimes are sufficiently separated from each 
other. Using multiple frequency FLIM (fluorescent lifetime imaging) the 
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relative amounts of these three GFP mutants expressed in the same cell 
could be determined with high precision and hence the promoter activity 
of at least three genes. 

5 Multifrequency FLIM is described in UK Patent Application No 
9817227.3 entitled "Multiple Frequency Fluorescence Lifetime Imaging" 
and the PCT application claiming priority from that application and which 
has the same filing date as this application. 

10 The invention will now be described in more detail with reference to the 
following Figures and Example wherein: 

Figure 1 shows the cDNA and amino acid sequence of A. victoria green 
fluorescent protein (GFP). 

15 

Figure 2 shows the cDNA and amino acid sequence of a prior art mutant 
GFP (mmGFPS; Zernicka-Goetz et al). 



Figure 3 shows the cDNA and amino acid sequence of a polypeptide of 
20 the invention (called mmYFP or mYFP5 or YFP5) which is described in 
more detail in Example 1. 

Figure 4 is a comparison of the cDNA sequences from Figures 1 to 3. 



25 



Figure 5 is a comparison of the amino acid sequences from Figures 1 to 3 . 
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Example 1: Construction of mutant GFP and its properties 



The mutant GFP which we call YFP5, which is a red-shifted mutant of 
MmGFP5, was generated by PCR-mediated site-directed mutagenesis of 

5 MmGFPS (Zernicka-Goetz et al (1997) Development 124, 1133-1137). 
MmGFP5 is a wtGFP mutated in V163A, S175G, I167T, F64L and 
S65T; the mutations V163A, S175G and I167T were introduced into 
wtGFP by Siemering et al (1996) Current Biol 6, 1653-, and Zernicka- 
Goetz et al introduced the mutations F64L and S65T). This approach 

10 introduced mutations S72A and T203Y into MmGFPS using primer pairs 

ATGCGGCCGCGAATTCGCCACCATGGGTAAAGGAGAAGAACTT and 

CTGGGTATCTTGCGAAGCATTGTACGTACAATGCTTCGCAAGATACCCAG ; and 
GAAAGGGCAGATTGATAGGACAGGTAATGCATTACCTGTCCTATAATCTGCCCTT 
TC and AAGGATCCTCTAGAAGCTTTTGTATAGTTCATCCATG. The 

15 underlined nucleotides indicate mismatches. 

The fluorescent lifetimes of various GFP mutants are shown in Table 1. 
References to Table 1 

20 

1. Heim & Tsien (1996). Engineering green fluorescent protein for 
improved brightness, longer wavelengths and fluorescence resonance 
energy transfer. Curr. Bio. 6, 178-182. 



25 2. Orme M et al (1996). Crystal structure of the Aequorea victoria 
green fluorescent protein. Science 273, 1392-1395. 
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3. Zernicka-Goetz et al (1997). Following cell fate in the living 
mouse embryo. Development 124, 1 133-1 137. 

4. Miyawaki et al (1997). Fluorescent indicators for Ca 2+ based on 
5 green fluorescent proteins and calmodulin. Nature 388, 882-887. 

The final PCR product was gel-purified, digested with EcdRl and Xbal 
and subcloned into pEFT7MCS. This vector is based on pEF-BOS 
(Nucleic Acids Res. (1990) Sep 11; 18(17), 5322 pEF-BOS, a powerful 

10 mammalian expression vector. Mizushima S, Nagata S). A modified 
version of pEF-BOS containing a Neo resistance expression cassette, 
pEFl-Neo, was obtained from G. Baier, Innsbruck. The Neo expression 
cassette to make the vector smaller and introduced a T7 RNA polymerase 
promoter as well as several unique restriction enzyme sites downstream of 

15 the human EFlct promoter and the SV40 polyadenylation site. 

Any other suitable vector, as described in the specification, may be used 
for the expression of the mutant GFP. The introduced mutations were 
verified by sequencing using Sequenase. The sequence of YFP5 is given 
20 in Figure 3. 

The respective GFP mutants were expressed in cells by microinjection 
(Pepperkok et al, 1997 in "Microinjection and Transgenesis", eds. Cid- 
Arregui and Garcia-Carranca, Springer, Heidelberg, pp 145-154) of 
25 plasmids based on the vector pEFT7MCS and with inserts of the 
respective GFP encoding cDNAs. At 2h after microinjection cells were 
mounted on the FLIM microscope set-up and the respective lifetimes were 
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determined at 37 °C in living cells. Any suitable expression system or 
lifetime-detection system may be used. 



YFP5 shows a well-separated and significandy longer lifetime than other 
5 GFP mutants making it an ideal partner in multi-labelling FLIM 
experiments. 
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CLAIMS 

1. A polypeptide comprising the functional portion of green 
fluorescent protein but wherein valine (V) 163 has been replaced with 

5 alanine (A), serine (S) 175 has been replaced with glycine (G), isoleucine 
(I) 167 has been replaced with threonine (T), phenylalanine (F) 64 has 
been replaced with leucine (L), serine (S) 65 has been replaced with 
threonine (T), serine (S) 72 has been replaced with alanine (A), and 
threonine (T) 203 has been replaced with tyrosine (Y). 

10 

2. A polypeptide which has the amino acid sequence 

MGKGEELFTGWPILVELDGDVNGQKFSWGEGEGDATYGKLTLKFICTTGKXiPVPWPTLVTTLTY 

GVQCFARYPDHMKR 
15 HDFLKSAMPEGYVQERTIFFKDDGNYKTRAEVKF^ 
SHNVYIMADKQKNG 

IKANFKTRHNIEDGGVQLADHYQQNTP I GD^ 
AG I THGMDEL YKS F . 

20 3. A polypeptide according to Claim 1 which has the additional amino 
acid replacements S2G, H25Q, S30R, Q80R, F84L, N212H, E213G, 
A216S, and additional residues 239S and 240F. 

4. A polypeptide comprising at least residues 7 to 229 of green 
25 fluorescent protein containing the amino acid replacements as defined in 
any of Claims 1 to 3. 
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5. A polypeptide according to any one of Claims 1, 3 and 4 
comprising a further fluorescent moiety. 

6. A polypeptide according to Claim 5 wherein the further fluorescent 
5 moiety is a green fluorescent protein or a variant thereof. 

7. A polypeptide according to any one of Claims 1, and 3 to 6 which 
is a fusion polypeptide. 

10 8. A polypeptide according to Claim 7 wherein the fusion polypeptide 
is one used in a biological system which makes use of FRET. 

9. A polynucleotide encoding a polypeptide according to any one of 
the preceding claims. 

15 

10. An expression vector encoding a polypeptide according to any one 
of Claims 1 to 8. 

11. A host cell comprising a polynucleotide according to Claim 9 or an 
20 expression vector according to Claim 10. 

12. Use of a polypeptide according to any one of Claims 1 to 8 as a 
reporter molecule in a cell. 

25 13. Use of a polynucleotide according to Claim 9 or an expression 
vector according to Claim 8 to express a reporter molecule in a cell. 
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14. 



Any novel fluorescent protein as herein described. 
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Figure 1 

A.victoria GFP: 
cDNA: 

atgagtaaaggagaagaacttttcactggagttgtcccaattcttgttgaattagatggtgatgttaatgggcacaaatt 
ttctgtcagtggagagggtgaaggtgatgcaacatacggaaaacttacccttaaatttatttgcactactggaaaactac 
ctgttccatggccaacacttgtcactactttctcttatggtgttcaatgcttttcaagatacccagatcatatgaaacag 
catgactttttcaagagtgccatgcccgaaggttatgtacaggaaagaactatatttttcaaagatgacgggaactacaa 
gacacgtgctgaagtcaagtttgaaggtgatacccttgttaatagaatcgagttaaaaggtattgattttaaagaagatg 
gaaacattcttggacacaaattggaatacaactataactcacacaatgtatacatcatggcagacaaacaaaagaatgga 
atcaaagttaacttcaaaattagacacaacattgaagatggaagcgttcaactagcagaccattatcaacaaaatactcc 
aattggcgatggccctgtccttttaccagacaaccattacctgtccacacaatctgccctttcgaaagatcccaacgaaa 
agagagaccacatggtccttcttgagtttgtaacagctgctgggattacacatggcatggatgaactatacaaataa 

protein: 

MSKGEELFTGWPILVELIX3DWGHKFSVSGEGEGDATYGKLTLKFICTTGPCL.PVPWPTLVTTFSYGVQCFSRYPDHMKQ 
HDFFKSAMPEGYVQERTI FFKDDGNYKTRAEVKFEGDTLVNRI ELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNG 
I KVNFKI RHNI EDGS VQLADHYQQNTP IGDGPVLLPDNHYLSTQSALS KDPNEKRDHMVLLEFVTAAGITHGMDELYKZ 
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Figure 2 



mmGFP5: 
CDNA: 

at 99gtaaaggagaagaacttttcactggagttgtcccaattcttgttgaattagatggtgatgttaatgggcaaaaatt 
c tc tg tcaggggagagggtgaagg tga tgcaaca tacggaaaac t taccct taaa 1 1 1 a 1 1 tgcac tactgggaagctac 

ctgttccctggccaacacttgtcactactttgacttatggtgtacaatgcttctcaagatacccagatcatatgaagcgg 
cacgacttcctcaagagcgccatgcctgagggatacgtgcaggagaggaccatcttcttcaaggacgacgggaactacaa 
gacacgtgctgaagtcaagtttgagggagacaccctcgtcaacaggatcgagcttaagggaatcgatttcaaggaggacg 
gaaacatcctcggccacaagttggaatacaactacaactcccacaacgtatacatcatggccgacaagcaaaagaacggc 
atcaaagccaacttcaagacccgccacaacatcgaagacggcggcgtgcaactcgctgatcattatcaacaaaatactcc 
aattggcgatggccctgtccttttaccagacaaccattacctgtccacacaatctgccctttcccaagatccccacggaa 
agagagatcacatggtccttcttgagtttgttacatctgctgggattacacatggcatggatgaactatacaaaagcttc 
tag 

protein: 

MGKGEELFTGVVPILVELDGDVNGQKFS 

HDFLKSAMPEGWQERTIFFKDDGNYKTI^EV^ 
- IKJ&FK?TRHNIEIXX3VQLADHY<^^ 
* Z 



2/7 



SUBSTITUTE SHEET (RULE 26) 



WO 00/08054 



PCT/GB99/02596 



Figure 3 



mmYFP: 
cDNA: 

atgggtaaaggagaagaacttttcactggagttgtcccaattcttgttgaattagatggtgatgttaatgggcaaaaatt 

ctctgtcaggggagagggtgaaggtgatgcaacatacggaaaacttacccttaaatttatttgcactactgggaagctac 

ctgttccctggccaacacttgtcactactttgacttatggtgtacaatgcttcGcaagatacccagatcatatgaagcgg 

cacgacttcctcaagagcgccatgcctgagggatacgtgcaggagaggaccatcttcttcaaggacgacgggaactacaa 

gacacgtgctgaagtcaagtttgagggagacaccctcgtcaacaggatcgagcttaagggaatcgatttcaaggaggacg 

gaaacatcctcggccacaagttggaatacaactacaactcccacaacgtatacatcatggccgacaagcaaaagaacggc 

atcaaagccaacttcaagacccgccacaacatcgaagacggcggcgtgcaactcgctgatcattatcaacaaaatactcc 

aattggcgatggccctgtccttttaccagacaaccattacctgtccTATcaatctgccctttcccaagatccccacggaa 

agagagatcacatggtccttcttgagtttgttacatctgctgggattacacatggcatggatgaactatacaaaagcttc 
tag 



protein: 

MGKGEELFTGVVPILVELDGDVNGQKFSVRGEGEGDATYGKL^ 

HDFLKSAMPEGYVQERTI FFKDDGNYKTRAEVKFEGDTLVNRI ELKGIDFKEDGNI LGHKLE YNYNSHNVYIMADKQKNG 
IKANFKTRHNIEDGGVQLADHYQQNTPIGTC 
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Nucleic acid alignment: 



Figure 4 (page 1 of 3) 

Nucleic acid alignment: 

. . . .10 . - . .20 . . . .30 . . . .40 . . . .50 . . . .60 

A . vict . GFP 1 : ATGaGTAAAGGAGAAGAACTTTTCACTGGAGTTGTCCCAATTCTTGTT : 4 8 

MMGFP5 1 : GAATTCX3CCACGATGGGTAAAGGAGAAGAACTTTTC^ : 6 0 

MMYFP5 1 : GAATTCX3CCACCATGGGTAAAGGAGAAGAACTTTTCACTGG : 6 0 

consensus l:************!!l*!!!!!!!!l!!!!!!!!!!I!!!!!!!!!!!!M!!!!M!!!!: 60 

. . . .70 . . . .80 . . . .90 . . . 100 . . . 110 . . . 120 
A. vict - GFP 4 9 : GAATTAGATGGTGATGTTAATGGGCAcAAA.TTtTCTGTCAGtGGAGAGGGTGAAGGTGAT : 1 0 8 
MMGFP5 6 1 : GAATTAGATGGTGATGTTAATGGGCAAAAATTCTCTGTCAGGGGAGAGGGTGAAGGTGAT :120 

MMYFP5 6 1 : GAATTAGATGGTGATGTTAATGGGCAAAAATTCTCTGTCAGGGGAGAGGGTGAAGGTGAT : 12 0 

consensus 61 :!!!!!!! 1 ! ! ! 1 ! ! ! ! I!!!!!!!!!*!!!! I MM!!!!!*!!!!!!!!!!!!!!!!!! : 120 

. . . 130 . . . 140 . . . 150 . . - 160 . . . 170 . . . 180 
A. vict .GFP 10 9: GCAACATACGGAAAACTTACCCTTAAATTTATTTGCACTACTGGaAAaCTACCTGTTCCa : 168 
MMGFP 5 121: GCAAC^TACGGAAAACTTACCCTTAAATTTATTTGCACTACTGGGAAGCTACCTGTTCCC : 1 8 0 

MMYFP5 121: GCAACATACGGAAAACTTACCCTTAAATTTATTTGCACTACTGGGAAGCTACCTGTTCCC : 1 8 0 

consensus 121: I!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!*!!*!!!!!!!!!! !*:180 

. 190 . . . 200 . . . 210 . . . 220 . . . 230 . . . 240 
A . vict . GFP 16 9: TGGCCAACACTTGTCACTACTTTC t CTTATGGTGT t CAATGCTT t TCAAG ATACCCAG AT : 22 8 
MMGFP 5 181: TGGCCAACACTTGTCACTACTTTGACTTATGGTGTACAATGCTTCTCAAGATACCCAGAT : 2 4 0 

MMYFP 5 181: TGGCCAACACTTGTCACTACTTTGACTTATGGTGTACAATGCTTCg CAAGATACCCAGAT : 2 4 0 

consensus 181: ! I!!!!!!!!!!!!!!!!!!!!!**!!! 1 !!!!!!*!! I !!!!!**!!!!!!!!!! ! ! ! ! :240 

. . . 250 . . . 260 . . . 270 . . . 280 . . . 290 . . . 300 
A . vict . GFP 22 9: CAT ATGAAa CaGCA t GACTT 1 1 TCAAG AG tGCCATGCCcGAaGG t TAtGTaCAGGAaAGa : 2 8 8 
MMGFP 5 241: CATATGAAGCGGCACGACTTCCTCAAGAGCGCCATGCCTGAGGGATACGTGCAGGAGAGG : 3 0 0 

MMYFP5 241: CATATGAAGCGGCACG ACTTCCT C AAG AG CG CC ATGC CTG AGGG AT ACGTGGAGG AG AGG : 3 0 0 

consensus 241 MM!!!!!*!*!!!*!!!!!**!!*!!!!*! J ! I!!!!*!!*!!*!!*!!*!!!!!*!!*: 300 

. 310 . '. . 320 . . • 330 . . . 340 . . . 350 . . . 360 
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Figure 4 (page 2 of 3) 

A. vict .GFP 289 : ACtATaTTtTTCAAaGAtGACGGGAACTACAAGACACGTGCTGAAGTCAAGTTTGAaGGt : 348 

MMGFP5 3 01: ACCATCTTCTTCAAGGACGACGGGAACTACAAGACACGTGCTGAAGTCAAGTTTGAGGGA : 3 6 0 

MMYFP5 3 01: ACCATCTTCTTCAAGGACGACGGGAACTACAAGACACGTGCTGAAGTCAAGTTTGAGGGA : 3 6 0 

consensus 301: I!*!!*!!*!!!!!*!!*!!!!!!!!!!!!!!!!!!!!!!!*!!!!!!!!!!!!!!*! !*:360 



. . . 370 . . . 380 . . . 390 . . . 400 . . . 410 . . . 420 
A. vict . GFP 3 4 9: GAtACCCTtGTt AAtAGaATCGAGtTaAAaGGtATtGATTTt AAaGAaGAtGGAAACATt : 408 
MMGFP5 3 61: GACACCCTCGTCAACAGGATCXSAGCTTAAGGGAATCGATTTCA^ : 4 2 0 

MMYFP 5 361: GACACCCTCGTCAACAGGATCGAGCTTAAGGGAATCGATT^ : 4 2 0 

consensus 361: !!*!!!!!*!!*!!*!!*!!!!!!*!*!!*!!*!!*!!!!!*!!*!!*!!*!!!!!!! i*:420 



. . . 430 . . . 440 . - . 450 . . . 460 . . . 470 . . . 480 
A . vie t . GFP 4 09: CT t GGa CAGAAa TTGGAATACAACTAt AACTCa CACAAt GT AT ACATCATGGCaGACAAa : 4 6 8 
MMGFP 5 4 21: CTCGGCCACAAGTTGGAATACAACTACAACTCCCACAACGTATACA : 4 8 0 

MMYFP 5 4 21: CTCGGCCACAAGTTGGAATACAACTACAACTCCCACAACGTATACATGATGGCC^ : 4 8 0 

consensus 421: !!*!!*!!!!!*!!!!!1!M!HM*!!!!!*!!!!!*!!!!!!!!!!!!!!*!!!! !*:480 



. . . 490 . . . 500 . . . 510 . . . 520 . . . 530 . . . 540 
A. vict .GFP 4 69 :GkAAAGAAtGGaATCAAAGttAACTTC7*AaAt 

MMGFP 5 4 81: CAAAAGAACGGCATCAAAGCCAACTTCAAGACCCGCCAC : 5 4 0 

MMYFP5 4 81: CAAAAGAACGGCATCAAAGCCAACTTCAAGACCCGC^ : 5 4 0 

consensus 481: 1 !!!!!!!*!!*!! J i !!!**!!!! !!! I*!***!*!!!!!!!!*!!!!!*!!**!!!!*: 540 



. . . 550 . . . 560 . . . 570 . - . 580 . . . 590 . . . 600 
A. vict. GFP 529: CAACTaG CaG Ac CATTATCAACATy^TACTCCAATTGGCGATGGCCCTGTCCITTTACCA : 5 8 8 
MMGFP 5 541: CAACTCGCTG AT CATTATCAACAAAATACT CCAATTGGCG ATGGCCCTGTCCl w ri w rACCA : 6 0 0 

MMYFP5 541: CAACTCGCTGATCATTATCAACAAAATACTCCAATTGGCGATGGCCCTGTCCTTTTACCA : 6 0 0 

consensus 541 : I I !!!*!!*!!*!!!!!! I ! ! ! ! ! ! ! !!!!!!!!!!! 1 ! ! ! ! ! U !!!!!!!!!!!!!! ! :600 



610 . . . 620 . . . 630 . . . 640 . . . 650 . . . 660 
A. vict .GFP 5 89: GACAACCATTACCTGTCCACACAATCTGCCCTTTCgaAAGATCCCaACGaAAAGAGAGAc : 6 4 8 
MMGFP5 601: GACAACCATTACCTGTCCACACAATCTGCCCTTTCCCAAGATCCCCACGGAAAGAGAGAT : 6 6 0 

MMYFP 5 6 01: G ACAACCATT ACCTGTCC tat CAATCTGCCCTTTCCCAAGATCCCCACGGAAAG AG AGAT : 6 6 0 

consensus 601 :!!!!!!!!!!!!•!!!!!***! ! ! ! M U !!!!!!**!!!!!! M *!!!*!!!•!!!!!* : 660 
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Figure 4 (page 3 of 3) 



. . . 670 . . . 680 . . . 690 . . - 700 . . . 710 . . . 720 
A.vict.GFP 649: CACATGGTCCITCTTGAGTTTGTaACAgCrrGCTGGGATTACACATGGCATGGATGAACTA : 70 8 
MMGFP5 661 : CACATGGTCCTTCI^GAGTTTGTTACATCTGCTGGGATTACACATGGCATGGATGAACTA: 720 

MMYFP5 661 : CACATGGTCCTTCTTGAGTTTGTTACATCT 

consensus 661: !!!!!!!!!! M !!!!!!!!!!!*!!!* 1 !! ! i !!!!*!! i MM!!!!!! ! i ! ! ! ! ! ! i : 720 



. . . 730 . . . 

A.vict.GFP 709 :TACAAAtaa :717 

MMGFP 5 721: T ACAAAAGCTTCTAGA : 7 3 6 

MMYFP 5 721: T ACAAAAGCTTCTAGA : 7 3 6 

consensus 721: ! ! ! ! ! !★********♦: 736 
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Figure 5 

protein alignment: 



A.vict. l:MsKGEELFTGWPILVELDGDVNGhKFSVsGEGEGDATYGKLTLKFICTTGKLPVPWPTL: 60 

mmGFPS 1 : MGKGEELFTGWPILVELDGDVNGK3KFSVRGEGEGDATYGKLTLKFICTTGKLPVPWPTL : 60 

mmYFPS 1 : MGKGEELFTGWPILVELDGDVNGQKFSVRGEGEGDATYGKLTLKFI CTTGKLPVPWPTL : 6 0 

consensus 1: !*!!!!!!!!!!!!!!!!!!!!!!*!!!!*!!!!!!!!!!!!!!!!!!!!!! n ! i . ! ; j . 60 



. . . .70 . . . .80 . . . .90 . . . 100 . . . 110 . . . 120 
A . vie t . 61: VTTf s YGVQCFSRYPDHMKqHDFf KS AMPEGYVQERTI FFKDDGNYKTRAEVKFEGDTLV : 12 0 

mmGFP5 61 : VTTLTYGVQCFSRYPDHMKRHDFLKSAMPEGYVQERTI FFKDDGNYKTRAEVKFEGDTLV : 120 

mmYFPS 6 1 : VTTLTYGVQCFaRYPDHMKRHDFLKS AMPEGYVQERTI FFKDDGNYKTRAEVKFEGDTLV : 12 0 

consensus 61 :!!!**!!!!!!*!!! 1 i !!*!!!*!!!!!! J !!!!! 1 !!!!!!!!!!! ! ! ! ! ! ! ! i iin . X20 



. . . 130 . . . 140 . . . 150 . . . 160 . . . 170 . . . 180 
A . vie t . 121: NRIELKGIDFKEDGNILGHKLE YNYNSHNVYIMADKQKNGI KvNFKiRHNIEDGs VQLAD : 1 8 0 
mmGFPS 121 :NRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKANFKTR 180 

mmYFPS 121 :NRIELKGIDFKEDGNII^HKLEYNYNSHNVYIMADKQKNGIKANFKT 

consensus 121:!!!!!!!!!!i!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!*!!!*!!!!!!!*!!!!!:180 



. . . 190 . . . 200 . . . 210 - . . 220 . . . 230 . . . 240 
A.vict . 181 : HYCXJNTPIGDGPVLLPDNHYLSTQSALSkDPneKRDHMVLLEFVTaAGITHGMDELYKz . :238 
mmGFPS 181 : HYQQNTPIGDGPVLLPDNHYLSTQSALSQDPHGKRDHMVLLEFVTSAGITHGMDELYKSF : 240 

mmYFPS 181 : HYQQNTPIGDGPVLLPDNHYLSyQSALSQDPHGKRDHMVLLEFVTSAGITHGMDELYKSF : 24 0 

consensus 181 :!!!!!!! i I !!!!!!!!! 1 I !!*!!! i !*!!**!! U !!! I !!!!*!!! I !!!!!! 1 ! ** :240 
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<110> Bastiaens, Philippe 
Pepperkok, Rainer 
Geley, Stefan 

Imperial College Research Technology Limited 
<120> Fluorescent protein 



<130> IMPWP21223 



<140> 
<141> 



<150> GB 9817225.7 
<151> 1998-08-08 



<160> 6 



<170> Patentln Ver. 2.0 



<210> 1 
<211> 722 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : Mutant GFP 



<400> 1 

atgggtaaag gagaagaact tttcactgga gttgtcccaa ttcttgttga attagatggt 60 
gatgttaatg ggcaaaaatt ctctgtcagg ggagagggtg aaggtgatgc aacatacgga 120 
aaacttaccc ttaaatttat ttgcactact gggaagctac tgttccctgg ccaacacttg 180 
tcactacttt gacttatggt gtacaatgct tcgcaagata cccagatcat atgaagcggc 240 
acgacttcct caagagcgcc atgcctgagg gatacgtgca ggagaggacc atcttcttca 300 
aggacgacgg gaactacaag acacgtgctg aagtcaagtt tgagggagac accctcgtca 360 
acaggatcga gcttaaggga atcgatttca aggaggacgg aaacatcctc ggccacaagt 420 
tggaatacaa ctacaactcc cacaacgtat acatcatggc cgacaagcaa aagaacggca 480 
tcaaagccaa cttcaagacc cgccacaaca tcgaagacgg cggcgtgcaa ctcgctgatc 540 
attatcaaca aaatactcca attggcgatg gccctgtcct tttaccagac aaccattacc 600 
tgtcctatca atctgccctt tcccaagatc cccacggaaa gagagatcac atggtccttc 660 
ttgagtttgt tacatctgct gggattacac atggcatgga tgaactatac aaaagcttct 720 
ag 722 



<210> 2 
<211> 240 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : Mutant GFP 
<400> 2 

Met Gly Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
1 5 -10 15 

Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Arg Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
50 55 60 

Thr Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys Arg 
65 70 75 80 

His Asp Phe Leu Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly He 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Ala Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Gly Val 
165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu Ser 
195 200 205 

Gin Asp Pro His Gly Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 



Thr Ser Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys Ser Phe 
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225 230 235 240 



<210> 3 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR primer 
<400> 3 

atgcggccgc gaattcgcca ccatgggtaa aggagaagaa ctt 

<210> 4 
<211> 49 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 4 

ctgggtatct tgcgaagcat tgtacgtaca atgcttcgca agatacccag 

<210> 5 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 5 

gaaagggcag attgatagga caggtaatgc attacctgtc ctataatctg ccctttc 

<210> 6 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : PCR primer 



<400> 6 
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aaaaggatcc tctagaagct tttgtatagt tcatccatg 39 
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